Goto

Collaborating Authors

 Erie


Machine Learning Enabled Early Warning System For Financial Distress Using Real-Time Digital Signals

pant, Laxmi, Reza, Syed Ali, Rahman, Md Khalilor, Rahman, MD Saifur, Sharmin, Shamima, Mithu, Md Fazlul Huq, Hasnain, Kazi Nehal, Farabi, Adnan, khanom, Mahamuda, Kabir, Raisul

arXiv.org Artificial Intelligence

International Journal of Applied Mathematics Volume 38 No. 5 s, 2025 ISSN: 1311 - 1728 (printed version); ISSN: 1314 - 8060 (on - line version) Received: August 0 7, 2025 550 Abstract The growing instability of both global and domestic economic environments has increased the risk of financial distress at the household level. However, traditional econometric models often rely on delayed and aggregated data, limiting their effectiveness. This study introduces a machine learning - based early warning system that utilizes real - time digital and macroeconomic signals to identify financial distress in near real - time. Using a panel dataset of 750 households tracked over three monitoring rounds spa nning 13 months, the framework combines socioeconomic attributes, macroeconomic indicators (such as GDP growth, inflation, and foreign exchange fluctuations), and digital economy measures (including ICT demand and market volatility). Through data preproces sing and feature engineering, we introduce lagged variables, volatility measures, and interaction terms to capture both gradual and sudden changes in financial stability. We benchmark baseline classifiers, such as logistic regression and decision trees, ag ainst advanced ensemble models including random forests, XGBoost, and LightGBM. Our results indicate that the engineered features from the digital economy significantly enhance predictive accuracy. The system performs reliably for both binary distress dete ction and multi - class severity classification, with SHAP - based explanations identifying inflation volatility and ICT demand as key predictors. Crucially, the framework is International Journal of Applied Mathematics Volume 38 No. 5 s, 2025 ISSN: 1311 - 1728 (printed version); ISSN: 1314 - 8060 (on - line version) Received: August 0 7, 2025 551 By implementing machine learning in a transparent and interpretable manner, this study demonstrates the feasibility and impact of providing near - real - time early warnings of financial distress. This offers actionable insights that can strengthen household resilience and guide preemptive intervention strategies. Keywords: Financial Distress, Early Warning Systems, Machine Learning, Digital Economy, Temporal Classification, Explainable AI 1. Introduction 1.1 Background and Motivation The prediction of financial distress has long been recognized as a critical element for ensuring economic resilience and mitigating systemic risk across households, firms, and national economies.


Meteorologist's stark warning to Americans to brace for a harsh winter with less snow but more nor'easters

Daily Mail - Science & tech

'Four dead and 12 injured' in Mississippi shooting after people descend on town for homecoming game Joe Biden, 82, receiving new treatment after'aggressive' cancer spread to his bones REVEALED: The secret George Soros network'behind America's street chaos'... and the dossier that shows how to stop it Tinnitus destroyed Peter's life but doctors dismissed him. Then he tried an extraordinary drug-free University of Cambridge-backed treatment that gives instant relief - no wonder medics say it's so'exciting' KENNEDY: Obama's bitter post about Trump's Gaza peace deal proves what I've long suspected about Barry... and it would make Sigmund Freud blush Gold is soaring... here's what the pros say you should do with your 401(k) before it's too late Model dubbed'the world's most beautiful girl' when she was six is now all grown up and looks VERY different as she poses up a storm at Paris Fashion Week Teacher was'so high on cocaine she thought one of her students was her dog' But now, a royal insider claims they're'just as entitled as their parents' with'shady friends' Heartbreaking moment NFL reporter makes brutal comment about player Xavier Legette's dead father in locker room interview Experts reveal the surprising TRUTH behind RFK Jr's link between circumcision and autism Bombshell records that damn Letitia James and show Trump was RIGHT... and the staggering sum she was swindling Trump starts DOGE 2.0 as mass layoffs take place across federal government amid shutdown Famed'Big Short' investor gives terrifying verdict on Trump hammering China with 100 PERCENT tariff... and issues doomsday warning to Wall Street Jennifer Aniston, you've betrayed every woman with your selfish admission about not having children: CAROLINE BULLOCK Meteorologist's stark warning to Americans to brace for a harsh winter with less snow but more nor'easters Meteorologists are already predicting what the winter months will bring, with some regions of the US expected to see less snow than last year, and nor'easters anticipated to ravage parts of the Northeast. Paul Pastelok, chief meteorologist for AccuWeather's long-range forecasting team, told the Daily Mail that while he didn't expect above normal snowfall for the winter season, he warned that those in the Northeast should brace for nor'easters and it would still be a harsh winter. Pastelok explained that the nor'easter over this weekend is on trend with what is to come, as rapidly developing storms come in off the East Coast. 'People may say, Well, you're forecasting less snow, so it doesn't look like a harsh winter.


Cryptocurrency Price Forecasting Using Machine Learning: Building Intelligent Financial Prediction Models

Islam, Md Zahidul, Rahman, Md Shafiqur, Sumsuzoha, Md, Sarker, Babul, Islam, Md Rafiqul, Alam, Mahfuz, Shil, Sanjib Kumar

arXiv.org Artificial Intelligence

Cryptocurrency markets are experiencing rapid growth, but this expansion comes with significant challenges, particularly in predicting cryptocurrency prices for traders in the U.S. In this study, we explore how deep learning and machine learning models can be used to forecast the closing prices of the XRP/USDT trading pair. While many existing cryptocurrency prediction models focus solely on price and volume patterns, they often overlook market liquidity, a crucial factor in price predictability. To address this, we introduce two important liquidity proxy metrics: the Volume-To-Volatility Ratio (VVR) and the Volume-Weighted Average Price (VWAP). These metrics provide a clearer understanding of market stability and liquidity, ultimately enhancing the accuracy of our price predictions. We developed four machine learning models, Linear Regression, Random Forest, XGBoost, and LSTM neural networks, using historical data without incorporating the liquidity proxy metrics, and evaluated their performance. We then retrained the models, including the liquidity proxy metrics, and reassessed their performance. In both cases (with and without the liquidity proxies), the LSTM model consistently outperformed the others. These results underscore the importance of considering market liquidity when predicting cryptocurrency closing prices. Therefore, incorporating these liquidity metrics is essential for more accurate forecasting models. Our findings offer valuable insights for traders and developers seeking to create smarter and more risk-aware strategies in the U.S. digital assets market.


Secure Energy Transactions Using Blockchain Leveraging AI for Fraud Detection and Energy Market Stability

Khan, Md Asif Ul Hoq, Islam, MD Zahedul, Ahmed, Istiaq, Rabbi, Md Masud Karim, Anonna, Farhana Rahman, Zeeshan, MD Abdul Fahim, Ridoy, Mehedi Hasan, Chowdhury, Bivash Ranjan, Rabbi, Md Nazmul Shakir, Sadnan, GM Alamin

arXiv.org Artificial Intelligence

Peer-to-peer trading and the move to decentralized grids have reshaped the energy markets in the United States. Notwithstanding, such developments lead to new challenges, mainly regarding the safety and authenticity of energy trade. This study aimed to develop and build a secure, intelligent, and efficient energy transaction system for the decentralized US energy market. This research interlinks the technological prowess of blockchain and artificial intelligence (AI) in a novel way to solve long-standing challenges in the distributed energy market, specifically those of security, fraudulent behavior detection, and market reliability. The dataset for this research is comprised of more than 1.2 million anonymized energy transaction records from a simulated peer-to-peer (P2P) energy exchange network emulating real-life blockchain-based American microgrids, including those tested by LO3 Energy and Grid+ Labs. Each record contains detailed fields of transaction identifier, timestamp, energy volume (kWh), transaction type (buy/sell), unit price, prosumer/consumer identifier (hashed for privacy), smart meter readings, geolocation regions, and settlement confirmation status. The dataset also includes system-calculated behavior metrics of transaction rate, variability of energy production, and historical pricing patterns. The system architecture proposed involves the integration of two layers, namely a blockchain layer and artificial intelligence (AI) layer, each playing a unique but complementary function in energy transaction securing and market intelligence improvement. The machine learning models used in this research were specifically chosen for their established high performance in classification tasks, specifically in the identification of energy transaction fraud in decentralized markets.


Advanced fraud detection using machine learning models: enhancing financial transaction security

Fariha, Nudrat, Khan, Md Nazmuddin Moin, Hossain, Md Iqbal, Reza, Syed Ali, Bortty, Joy Chakra, Sultana, Kazi Sharmin, Jawad, Md Shadidur Islam, Safat, Saniah, Ahad, Md Abdul, Begum, Maksuda

arXiv.org Artificial Intelligence

The rise of digital payments has accelerated the need for intelligent and scalable systems to detect fraud. This research presents an end-to-end, feature-rich machine learning framework for detecting credit card transaction anomalies and fraud using real-world data. The study begins by merging transactional, cardholder, merchant, and merchant category datasets from a relational database to create a unified analytical view. Through the feature engineering process, we extract behavioural signals such as average spending, deviation from historical patterns, transaction timing irregularities, and category frequency metrics. These features are enriched with temporal markers such as hour, day of week, and weekend indicators to expose all latent patterns that indicate fraudulent behaviours. Exploratory data analysis reveals contextual transaction trends across all the dataset features. Using the transactional data, we train and evaluate a range of unsupervised models: Isolation Forest, One Class SVM, and a deep autoencoder trained to reconstruct normal behavior. These models flag the top 1% of reconstruction errors as outliers. PCA visualizations illustrate each models ability to separate anomalies into a two-dimensional latent space. We further segment the transaction landscape using K-Means clustering and DBSCAN to identify dense clusters of normal activity and isolate sparse, suspicious regions.


Enhanced semi-supervised stamping process monitoring with physically-informed feature extraction

Zhang, Jianyu

arXiv.org Artificial Intelligence

In tackling frequent batch anomalies in high-speed stamping processes, this study introduces a novel semi-supervised in-process anomaly monitoring framework, utilizing accelerometer signals and physics information, to capture the process anomaly effectively. The proposed framework facilitates the construction of a monitoring model with imbalanced sample distribution, which enables in-process condition monitoring in real-time to prevent batch anomalies, which helps to reduce batch defects risk and enhance production yield. Firstly, to effectively capture key features from raw data containing redundant information, a hybrid feature extraction algorithm is proposed to utilize data-driven methods and physical mechanisms simultaneously. Secondly, to address the challenge brought by imbalanced sample distribution, a semi-supervised anomaly detection model is established, which merely employs normal samples to build a golden baseline model, and a novel deviation score is proposed to quantify the anomaly level of each online stamping stroke. The effectiveness of the proposed feature extraction method is validated with various classification algorithms. A real-world in-process dataset from stamping manufacturing workshop is employed to illustrate the superiority of proposed semi-supervised framework with enhance performance for process anomaly monitoring.


Designing and Deploying AI Models for Sustainable Logistics Optimization: A Case Study on Eco-Efficient Supply Chains in the USA

Shawon, Reza E Rabbi, Hasan, MD Rokibul, Rahman, Md Anisur, Ghandri, Mohamed, Lamari, Iman Ahmed, Kawsar, Mohammed, Akter, Rubi

arXiv.org Artificial Intelligence

The rapid evolution of Artificial Intelligence (AI) and Machine Learning (ML) has significantly transformed logistics and supply chain management, particularly in the pursuit of sustainability and eco-efficiency. This study explores AI-based methodologies for optimizing logistics operations in the USA, focusing on reducing environmental impact, improving fuel efficiency, and minimizing costs. Key AI applications include predictive analytics for demand forecasting, route optimization through machine learning, and AI-powered fuel efficiency strategies. Various models, such as Linear Regression, XGBoost, Support Vector Machine, and Neural Networks, are applied to real-world logistics datasets to reduce carbon emissions based on logistics operations, optimize travel routes to minimize distance and travel time, and predict future deliveries to plan optimal routes. Other models such as K-Means and DBSCAN are also used to optimize travel routes to minimize distance and travel time for logistics operations. This study utilizes datasets from logistics companies' databases. The study also assesses model performance using metrics such as mean absolute error (MAE), mean squared error (MSE), and R2 score. This study also explores how these models can be deployed to various platforms for real-time logistics and supply chain use. The models are also examined through a thorough case study, highlighting best practices and regulatory frameworks that promote sustainability. The findings demonstrate AI's potential to enhance logistics efficiency, reduce carbon footprints, and contribute to a more resilient and adaptive supply chain ecosystem.


Toxicity of the Commons: Curating Open-Source Pre-Training Data

Arnett, Catherine, Jones, Eliot, Yamshchikov, Ivan P., Langlais, Pierre-Carl

arXiv.org Artificial Intelligence

Open-source large language models are becoming increasingly available and popular among researchers and practitioners. While significant progress has been made on open-weight models, open training data is a practice yet to be adopted by the leading open-weight models creators. At the same time, there researchers are working to make language models safer. We propose a data curation pipeline to reduce harmful outputs by models trained on public domain data. There are unique challenges to working with public domain data, as these sources differ from web text in both form and content. Many sources are historical documents and are the result of Optical Character Recognition (OCR). Consequently, current state-of-the-art approaches to toxicity filtering are often infeasible or inappropriate for open data models. In this paper, we introduce a new fully open-source pipeline for open-data toxicity filtering. Our contributions are threefold. We create a custom training dataset, ToxicCommons, which is composed of texts which have been classified across five different dimensions (racial/origin-based, gender/sex-based, religious, ability-based discrimination, and violence). We use this dataset to train a custom classifier, Celadon, that can be used to detect toxic content in open data more efficiently at a larger scale. Finally, we describe the balanced approach to content filtration that optimizes safety filtering with respect to the filtered data available for training.


Automating Patch Set Generation from Code Review Comments Using Large Language Models

Rahman, Tajmilur, Singh, Rahul, Sultan, Mir Yousuf

arXiv.org Artificial Intelligence

The advent of Large Language Models (LLMs) has revolutionized various domains of artificial intelligence, including the realm of software engineering. In this research, we evaluate the efficacy of pre-trained LLMs in replicating the tasks traditionally performed by developers in response to code review comments. We provide code contexts to five popular LLMs and obtain the suggested code-changes (patch sets) derived from real-world code-review comments. The performance of each model is meticulously assessed by comparing their generated patch sets against the historical data of human-generated patch-sets from the same repositories. This comparative analysis aims to determine the accuracy, relevance, and depth of the LLMs' feedback, thereby evaluating their readiness to support developers in responding to code-review comments. Novelty: This particular research area is still immature requiring a substantial amount of studies yet to be done. No prior research has compared the performance of existing Large Language Models (LLMs) in code-review comments. This in-progress study assesses current LLMs in code review and paves the way for future advancements in automated code quality assurance, reducing context-switching overhead due to interruptions from code change requests.


Parkinson's Disease Detection through Vocal Biomarkers and Advanced Machine Learning Algorithms

Sayed, Md Abu, Tayaba, Maliha, Islam, MD Tanvir, Pavel, Md Eyasin Ul Islam, Mia, Md Tuhin, Ayon, Eftekhar Hossain, Nob, Nur, Ghosh, Bishnu Padh

arXiv.org Artificial Intelligence

Parkinson's disease (PD) is a prevalent neurodegenerative disorder known for its impact on motor neurons, causing symptoms like tremors, stiffness, and gait difficulties. This study explores the potential of vocal feature alterations in PD patients as a means of early disease prediction. This research aims to predict the onset of Parkinson's disease. Utilizing a variety of advanced machine-learning algorithms, including XGBoost, LightGBM, Bagging, AdaBoost, and Support Vector Machine, among others, the study evaluates the predictive performance of these models using metrics such as accuracy, area under the curve (AUC), sensitivity, and specificity. The findings of this comprehensive analysis highlight LightGBM as the most effective model, achieving an impressive accuracy rate of 96% alongside a matching AUC of 96%. LightGBM exhibited a remarkable sensitivity of 100% and specificity of 94.43%, surpassing other machine learning algorithms in accuracy and AUC scores. Given the complexities of Parkinson's disease and its challenges in early diagnosis, this study underscores the significance of leveraging vocal biomarkers coupled with advanced machine-learning techniques for precise and timely PD detection.